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(54) Method of processing motion vectors histograms to detect interleaved or progressive 
pict u re structu res 



(57) A method of processing a bitstream of coded 
data of video sequences of progressive or interlaced 
pictures divisible in atop half -frame and in a bottom half- 
frame that comprises estimating motion vectors of 
groups of pixels belonging to said top half-frame of the 
current picture in relation to pixels belonging to the bot- 
tom half-frame of the preceding picture (MV TOP ^ Y (K)) 
and motion vectors of group of pixels of said bottom half- 
frame of the current picture in relation to pixels belong- 
ing to the top half-frame of the current picture (MV BOTxY 
(K)), respectively, characterized in that it comprises cal- 
culating for each macroblock of said top half-frame and 
a bottom half-frame a respective top motion coefficient 
( v top X| y( k )) and a D °ttorn motion coefficient (V BOT ^ Y 
(K)), depending on the estimation of the motion vectors 
of top half-frame (MV TOPx Y (K)) and bottom half-frame 
(MV BOTx Y (K)); recognizing the current picture as an in- 
terlaced picture by a substantial equality of the distribu- 
tions of values of said motion coefficients (V TOP ^ Y (K) ( 
V BOTx y (K)) or as a progressive picture by a substantial 
inequality of the distributions of values of said motion 
coefficients (V TOPx y (K), V bo1Xy (K)). 
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Description 

FIELD OF THE INVENTION 

[0001] The present Invention generally relates to methods for processing digital video pictures and in particular a 
method for detecting properties of a sequence of digital pictures. 

BACKGROUND OF THE INVENTION 

[0002] The increasing requirements of TV sets, in terms of quality of the displayed images, and of transmitting de- 
vices, in terms of amplitude of the frequency bands used during transmission, have made the techniques fortransmitting 
video sequences of crucial importance. 

[0003] Use of digital transceiving systems has made possible to split a frame of a picture in a matrix of numerical 
values representing the intensities of the video components of each pixel of a frame, allowing processing of the picture 
by computers. 

[0004] The possibility of treating a picture as a matrix of numbers, and so as a bitstream, offers the possibility of 
coding pictures to facilitate transmission and improve the quality of displayed images. . 

[0005] Systems for coding video digital sequences are based on the recognition of several properties of pictures. 
Among such properties, a particular importance have the detection of the character progressive or interlaced of the 
pictures and the detection of the motion of the objects. 

[0006] When these properties are known, it is possible to compress the picture making most efficient their transmis- 
sion, in terms of used frequency band, and most satisfactory, in terms of quality of their display. 
[0007] Several techniques of motion estimation and detection of the progressive or interlaced content (IPD: Interlaced 
Progressive Detection) of digital video sequences are known in literature. 

[0008] Such techniques are disclosed in US patent No. 5,734,420, WO 99/20040 and in EP-A-0917363, EP-A- 
0944245, EP-A-0979011, EP99830545.2, EP98830689.0 in the name of the same applicant. 

[0009] The methods disclosed in the above mentioned documents operate on the bit values representing the video 
components of single pixels, so requiring a computational complexity that makes them inappropriate for low cost ap- 
plications. 

[0010] In fact a frame, according to the PAL standard, is composed of 720*288=207360 pixels. Methods operating 
on each pixel are rather burdensome or impose simplifications that reduce the quality of the displayed picture in a more 
or less sensible manner. 

[0011] Methods of elaborating a bitstream of video digital sequences, operating on groups of pixels (macroblocks) 
in order to reduce the computation complexity are generally used for low cost applications. 

[0012] More precisely, need is felt for a method of elaborating data of digital video sequences that while operating 
on groups of pixels: 

• permits recognition of the processed picture as progressive or interlaced picture with a reduced number of calcu- 
lations; 

• reduces the amount of calculations motion estimation, though preserving quality and efficiency of the compression; 
or 

for the same compression efficiency and amount of calculations during the motion estimation, enhances the quality 
of displayed pictures. 

OBJECT AND SUMMARY OF THE INVENTION 

[001 3] It has been found and is the object of the present application a processing method usable with MPEG standard 
coded video sequences, requiring the performance of calculations only on motion vectors. Given that motion vectors 
can be defined even for a macroblock : instead of being defined for each single pixel, a great computational simplification 
may be achieved. 

[0014] The method of the invention allows the detection of the progressive or interlaced content of a picture for 
improving the effectiveness of the coding of video sequences and the effectiveness of the filtering that is made on the 
chrominance component of pictures input the coder. The method allows for an enhanced precision in the calculation 
of motion vectors by virtue of a pre-recognition of the processed picture as a progressive or interlaced picture. 
[001 5] Object of the invention is a method of processing a bitstream of coded data of video sequences of progressive 
or interlaced pictures, divisible in a top half-frame and in a bottom half-frame, that comprises estimating motion vectors 
of groups of pixels belonging to the top half-frame of the current picture in relation to pixels belonging to the bottom 
half-frame of the preceding picture and motion vectors of groups of pixels of the bottom half-frame of the current picture 
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in relation pixels belonging to the top half-frame of the current picture. 

[0016] Starting from such estimations, respective top and a bottom motion coefficients depending on the estimation 
of the motion vectors of the top and bottom half-frame are calculated for each macroblock of the top half-frame and 
bottom half-frame, the current picture is recognized as a progressive or an interlaced picture in function of the calculated 
top and bottom motion coefficients. A substantial equality of the distributions of values of motion coefficients pointing 
to an interlaced content by contrast, a substantial inequality of the distribution of values of said coefficients pointing to 
a progressive content. 

[0017] The recognition can be done by calculating a pair of shape coefficients representing the distributions of the 
top and bottom motion coefficients, recognizing the current picture as progressive or as interlaced depending on wheth- 
er the shape coefficients differ by a quantity greater or lower than a certain value. 

[0018] According to an alternative embodiment, it is possible to distinguish progressive pictures by from interlaced 
pictures calculating a coefficient representing the stochastic correlation between the distribution of the top and bottom 
motion coefficients, and recognizing the current picture as an interlaced picture or as a progressive picture depending 
on whether the calculated coefficient exceeds a certain value or not. 
[0019] Preferably recognition is established by: 

comparing the top and bottom coefficients with a pre-established top threshold and a bottom threshold, respec- 
tively; 

counting the number of motion vectors whose motion coefficients are lower than the top threshold of the top and 
the bottom frame of the current picture, producing a pair of respective coefficients; 

counting the number of motion vectors whose motion coefficients are greater than the bottom threshold of the top 
and bottom half -frame of the current picture, producing a second pair of respective coefficients; 
recognizing the current picture as a progressive or interlaced picture depending on the four coefficients relative to 
the current picture and to preceding pictures. 

[0020] The method of the invention may also comprise the following further steps: 

for each picture, calculating a temporary weight value as a function of the result of the recognition of the picture 
as a progressive or interlaced picture carried out according to the above mentioned method; 
calculating a final weight value as a function of the temporary weight value relative to the picture and of the final 
weight values relative to preceding pictures; 

recognizing a current picture as a progressive or an interlaced picture depending on its temporary weight value 
and on the final weight values relative to preceding pictures. 

[0021] These methods may be implemented even to get a refined calculation of the motion vectors of a picture of a 
video sequence by a Frame- Prediction technique if the current picture is recognized as a progressive picture or by a 
Field-Prediction technique if the current picture is recognized as an interlaced picture. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0022] The different aspects and advantages of the invention will result even more evident through the description 
of several embodiments of the invention and by referring to the attached drawings, wherein: 

Figure 1 is an example of a video sequence taken with a progressive camera; 
Figure 2 is an example of a video sequence taken with an interlaced camera; 

Figure 3 is an example of a possible distribution of the motion coefficients of the motion vectors of a top half-frame 
(top field) and of a bottom half-frame (bottom field) of a progressive sequence; 

Figure 4 is an example of a possible distribution of the motion coefficients of motion vectors of a top half-frame 

(top field) and of a bottom half-frame (bottom field) of an interlaced sequence; 

Figure 5 is a block diagram for executing an algorithm that calculates comparisons coefficients; 

Figure 6a is the flow chart of the IPD procedure of the invention; 

Figure 6b is a detailed flowchart of the functional block "Hysteresis" of Fig. 6a; 

Figure 6c is a flowchart of the functional block "Quasi-stationary" of Fig. 6a; 

Figure 7 is the scheme of detection of the progressive or interlaced content of pictures; 

Figure 8 is a block diagram of a generic two step motion estimator employing the IPD step of the invention. 
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DESCRIPTION OF SEVERAL EMBODIMENTS OF THE INVENTION 



[0023] The method of the invention allows the recognition of pictures of digital video sequences as interlaced or 
progressive pictures irrespectively of the standards used for coding, exclusively depending on previously performed 
5 estimations of motion vectors relative to groups of pixels. The method can be exploited to improve the calculation of 
motion vectors in terms of augmented precision and/or in terms of reduced computational complexity. 
[0024] The fundamental approach of the present invention consists in that by always carrying out motion estimation 
through a comparison of parameters of consecutive half-frames 1 it is possible to verify whether the pictures to code 
are interlaced or progressive. 

to [0025] in fact, in case of a progressive sequence, if the motion estimation is carried out on the bottom field with 
reference to the top field belonging to the same frame, the modulus of the resulting motion vectors will tend to zero. 
This happens because the two fields have been sampled by a progressive camera at the same time instant (in fact 
they belong to the same frame), and not at an interval of 1/50 (or of 1/60) of a second, as in the case of an interlaced 
PAL (or NTSC) camera. 

is [0026] Moreover, is very probable that the vertical component of motion vectors be comprised between -1 and 1, 
because of the vertical difference of a line between fields of different parity, while the horizontal component be practically 
zero. It is always possible that zones with an elevated vertical detail (spatial frequencies) deceive the estimator by 
having it estimating a non-existent vertical motion. However it is supposed to have a robust motion estimator not 
affected by this problem. 

20 [0027] By contrast, motion estimation between the top field of frame k and the previous bottom field of the frame k- 
1, will detect an effective motion not only in an interlaced sequence but also in an eventual progressive sequence, 
given that the two fields are sampled at different instants (and in fact they belong to different frames). 
[0028] Figures 1 and 2 illustrate this concept: in particular Fig. 1 shows the motion of a ball taken by a progressive 
camera, and Fig. 2 the same scene taken by an interlaced camera. As can be seen, while from a field to the successive 

25 of the sequence of Fig. 2 the motion vectors M V are always different from zero, this is not true for the motion vectors 
of the sequence of Fig. 1 . In fact, the motion vectors MV from a bottom field to a top field of the same frame k or k+ 1 
are always zero, while the motion vectors MV from a top field belonging to the frame k+ 1 to the bottom field belonging 
to the frame k are always different from zero. 

[0029] The criterion of the invention for distinguishing in a very easy manner interlaced pictures from progressive 

30 pictures is based on such a peculiarity. 

[0030] Let us indicate as MV TOPX Y (K) the motion vectors of the k-th top half -frame with respect to the (/c- 7)-th bottom 
half-frame and M V BOTX Y (K) the motion vectors of the /c-th bottom half-frame with respect to the k-th top half-frame, 
where k is the frame number. Such motion vectors MV =(V x ,V y ) may be relative to single pixels or to groups of pixels 
(macroblocks) of the /c-th picture, belonging to the top half -frame or the bottom half -frame, respectively. 

35 [0031] In order to'implement the method of the invention it is necessary to get, by estimating, even coarsely, the 
motion vectors MV TOPX Y (K) and MV BOTX Y (K), as many respective motion coefficients V TOPX Y (K) and V BOTX Y (K) and 
to recognize the current picture as a progressive or interlaced picture depending on such coefficients. 
[0032] A preferred way of calculating the motion coefficients V TOPX Y (K) and V BOTXY (K) is to calculate for each 
motion vector MV the sum of the absolute value of its components, i.e. to make 

40 

V T OPX.Y(K)=IV x l + IV y l 

by considering the motion vectors MV TOPX Y (K) of the top half-frame of the /c-th picture and 

45 

V B OTX.Y(KHV x l+IV y l 

by considering the motion vectors MV BOTX Y (K) of the bottom half-frame of the /c-th picture. 
50 [0033] Indeed the above way of calculating the motion coefficients is only one of several possible ones: for example 
it would be satisfactory to calculate the motion coefficients as the square root of the sum of squares of V x and V y , as 
well as in other ways, as will appear evident to a skilled person. 

[0034] By way of an example, the case wherein the motion coefficients V TOPX Y (K) e V BOTX Y (K) are calculated as 
the sum of the absolute values of the components of the motion vector MV will be described hereinbelow, though the 
55 method of the invention may be implemented even by calculating motion coefficients in a different manner. 

[0035] It is possible to distinguish an interlaced picture from a progressive picture by examining the distribution of 

1 A frame of a picture is divisible in two half-frames {fields), commonly called top (or odd) field and bottom (or even) field, because of the position of the 
lines constituting the frame. 
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the motion coefficients of its top half -frame and of its bottom half -frame. Experimentally, it has been found that pro- 
gressive pictures are characterized by notable differences in the distribution of top and bottom coefficients, while in- 
terlaced pictures are characterized by distinctly more uniform distributions of motion coefficients. 
[0036] This concept is clearly shown in Figs. 3 and 4 wherein two typical hystograms of the respective distributions 
5 of motion coefficients of a top half-frame (top field) and of a bottom half-frame (bottom field), relative to progressive 
sequences and interlaced sequences, respectively, are depicted. 

[0037] in particular, in Fig. 3 the difference between the two distributions is evident. The distribution of coefficients 
of the bottom field is characterized by a greater concentration (vertical axis) near the value 1 (horizontal axis), while 
the distribution of coefficients of the top field has a greater uniformity; on the contrary in Fig. 4 the two hystograms are 
io almost identical. 

[0038] Upon testing many different video sequences, calculating the respective distributions of motion coefficients, 
obtained with different motion estimators, it has been seen that: 

• the distributions of motion coefficients of bottom fields belonging to frames of movie sub-sequences (film-mode) 
15 show a distinct concentration about the maximum value, near to 1 , because of the vertical component of vectors. 

On the contrary in top fields such a maximum is distinctly lower, i.e. the dispersion is higher than in bottom fields; 

• the distributions of motion coefficients of top fields and of bottom fields belonging to interlaced sequences have a 
substantially similar shape and the dispersion of said distributions is greater the greater is motion present. 

20 [0039] Substantially the same significative shapes are obtained even for a single frame, besides for a whole se- 
quence. 

[0040] Substantially, an interlaced picture is substantially characterized in that the distribution of top motion coeffi- 
cients is almost identical to that of bottom motion coefficients, while a progressive picture is characterized by a notable 
difference between distributions of top and bottom motion coefficients and by a relatively stronger clustering near the 
25 maximum value of the bottom motion coefficients than of the top motion coefficients. 

[0041] The detection of one of the above mentioned characteristics is exploited for recognizing the progressive or 
the interlaced content of a picture. 

[0042] The stochastic variables theory gives several methods for recognizing one of the above mentioned properties. 
Such methods may consist in calculating a pair of shape coefficients representing the distribution of the top and of the 

30 bottom motion coefficients and in verifying whether between the two shape coefficients there is a substantial difference 
or not: in the positive case, the two distributions are recognized as similar, otherwise as different from each other. 
[0043] Alternatively it is possible to calculate a coefficient representative of the correlation of the two distributions 
and to verify whether such a correlation coefficient exceeds a certain value or not: in the positive case the picture is 
interlaced, otherwise progressive. 

35 [0044] Yet another method may consist in using as a shape coefficient the variance in respect to the maximum value 
of the two distributions of motion coefficients. Given that for a progressive picture the values of motion coefficients of 
the bottom field are more clustered near their maximum than the values of motion coefficients a top field, the variance 
for the top field will be greater than the variance for the bottom field. Therefore, it may be said that if the variance of 
the motion coefficients of the top field exceeds by a certain percentage the variance of the motion coefficients of the 

40 bottom field, the picture is progressive, otherwise interlaced. 

[0045] A particularly interesting application of the invention for performing a pre-test of the type of pictures, recog- 
nizing a picture as progressive by way of the motion coefficients is the one described hereinbelow. 
[0046] Sometimes in video sequences there are pictures, called "quasi -stationary", whose motion is limited to rela- 
tively small portions of the frame: a typical quasi-stationary picture is a picture representing a TV news speaker, wherein 

45 the motion is substantially limited to the face of the speaker while the remaining portion of the picture remains sub- 
stantially static. 

[0047] In such a situation there is a relatively small difference between a progressive and an interlaced picture, 
therefore, without introducing relevant errors it is possible to classify the examined picture as convenience. Because 
a most important application is for video sequences coded according to the MPEG standard, the quasi-stationary 
so pictures may be considered progressive because such a choice allows a considerable reduction of the number of 
calculations needed for the MPEG coding. 

[0048] The recognition of pictures as "quasi-stationary' 1 pictures is carried out by producing two sum coefficients, 
respectively top SV TOP and bottom SV BOT , as the sum of all top and bottom motion coefficients (represented by the 
signal B of the "quasi-stationary" block of the schemes of Fig. 6a and 6c), respectively. If both sum coefficients SV TOP 
55 and SV BOT are lower than respective pre-established positive numbers T1 and T2, that is: 
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s v top < T1 and SV BOT < T2 (1 ) 

then the examined picture will be recognized as a "quasi -static nary" picture. 

[0049] Given the great simplification resulting from the possibility of treating pictures as "quasi-stationary" pictures, 
it is possible to conveniently use the above described test before any subsequent test for detecting eventually the 
progressive or Interlaced content of a picture. 

[0050] A preferred way of discriminating progressive pictures from interlaced pictures consists in defining two pre- 
established threshold, respectively bottom THR1 and top THR2, and comparing the top motion coefficients V TOPX Y 
(K) and bottom motion coefficients V BOTX(Y (K) with these thresholds, indicating with 

N lTOP(K) = number of all MVtop x , y (k) such that Vtop x , y (K)<THR1 
N2top(K) = number of all MVtop x , y (k) such that Vtop x . y (k)>THR2 (2) 
NlBOT(K) = number of all MVbot x>y (k) such that Vbot X|Y (k)<THR1 
N2BOT(FC) = number of all MVbot x>y (k) such that Vbot XjY (k)>THR2 

[0051] An example of a possible functional block diagram for calculating the parameter N1 TOP (K) of a top field is 
shown in Fig. 5. The architecture for calculating the other three parameters is the same. 

[0052] Referring to the hystograms of Figs. 3 and 4, it may be noted that N1 TOP (K) and N1 BOT (K) give an indication 
of the maximum value of the distributions (if THR1 is greater than such a maximum value), while N2 TOP (K) and N2 BOT 
(K) indicate their dispersion (width of the dome-shaped curve envelopping the hystograms). 
[0053] Let us define the following ratios: 

R1(K)= NlTOP(K) /NlBOT(K-l) 

R2(K)=NlBOT(K)/NlTOP(K) (3) 
R3(k)= N2top(k) /.N2bot(k-1) 

R4(ft)= N2BOT(K) / N2TOP(K) 
[0054] Calling y, 5, e and t\ four pre-established numbers, if all the following tests verify: 

Rl(K)<y 

R2(K)>5 (4) 
R3(k) > c 
R4(K) < ti 
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then the /c-th frame is recognized as progressive, otherwise it is recognized as interlaced. 

[0055] The choice of using all four parameters, instead of using only R2(K) and R4(K) for example, is due to the fact 
that because a decision must be taken for each frame, in order to minimize uncertainty it is preferable to use all available 
information. In order to prevent errors, it is also possible to apply the following conditions: 

5 

R5(K) = R2(K) / Rl (K) > 0 (5) 

10 R6(K) = R3(K) / R4(K) > i 

being 6 and i two pre-established positive numbers. The latter conditions impose a more distinguishable difference 
between the two distributions for classifying a picture as a progressive one. 

15 [0056] An advantage of the above described procedures consists in the possibility of operating on motion vectors 
established for macrobiocks rather than exclusively for single pixels. This permits an evident simplification of calcula- 
tions that makes the method of the invention particularly appropriate even in low cost applications. 
[0057] In orderto avoid oscillations in the results provided by the procedure of the invention for detecting the interlaced 
or progressive content of the picture, that could be caused by short portions of a video sequence wherein the motion 

20 is such to make the discrimination very difficult, a simple hysteresis mechanism that considers the decisions taken in 
preceding frames may be introduced. 

[0058] A*' hysteresis mechanism may be realized by associating to each picture a temporary weight value P(K) de- 
pending on the recognition of the picture as a progressive or an interlaced picture, according to one of the above 
described methods of the invention, calculating for each processed picture a final weight value D(K) as a function of 

25 its temporary weight value P(K) and of the final weight values D(l) relative to one or more preceding pictures; recognition 
of the current picture as a progressive or interlaced picture being based on its final said weight value. 
[0059] An example of how such a hysteresis algorithm may be realized is shown in the flow sheet of Fig. 6a. 
[0060] First of all the number of previous pictures to be considered must be decided by attributing a value to the 
parameter NF, for example NF=3. Afterwards the system applies an arbitrary technique for estimating the motion vectors 

30 and calculates the motion coefficients V TOPX Y (K) and V BOTX Y (K). Starting from the values of these coefficients, the 
block QUASI-STATIONARY verifies whether the picture is quasi-stationary or not, according to the above described 
algorithm. 

[0061] In the positive case, the picture is classified as progressive, its final weight is established D(K)=a> and a 
successive picture is processed. 
35 [0062] In the negative case, the system calculates the ratios R1 (K), R2(K), R3(K), R4(K), R5(K) and R6(K) according 
to equations (3) and (5). If these ratios satisfy the inequalities (4) and (5), the block HYSTERESIS, described in more 
in detail in Fig. 6c, associates a temporary weight P(K) to the picture. Such a temporary weight is equal to a number 
p if the inequalities (4) and (5) are satisfied, otherwise to a number a. 

[0063] When the temporary weight is known, the block HYSTERESIS recognizes the current frame as a progressive 
40 frame if the sum of weights associated to the last two frames and to the current frame is greater than or equal to a 
certain number u., i.e.: 

P(K) + D(K-1) + D(K-2)>= u, 

45 

and the frame gets a final weight D(K)=p. In the contrary case, the current frame gets a final weight D(K)=co, is classified 
as interlaced. 

[0064] If equations (4) and (5) are not verified (input N of the "Hysteresis" block in Figs. 6a and 6b), the current frame 
k is temporarily classified as interlaced and assumes the temporary weight P(K)=ct. If the complementary condition is 
so verified: 

P(K) + D(K-1) + D(K-2) <= u. 

55 the final choice is interlaced frame with a final weight D(K)=a; in the contrary case, the frame is classified as progressive 
and assumes the final weight D(K)=co. 

[0065] According to a preferred embodiment of the invention, the weight a, (3, u, and co are equal to 0, 1 , 1 .5, 0.5, but 
these values may be changed depending on the needs, even during the same picture sequence. 
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[0066] With such a hysteresis procedure, the effective choice made in previous frames, by means of the weights D 
(K-1) and D(K-2), and the indication given by the various decision parameters R1(K), R2(K), R3(K), R4(K) are consid- 
ered, without giving a priority to any one of the two possible choices. 

[0067] A block diagram of an embodiment of the method of the invention is depicted in Fig. 7. Starting from an 
estimation of the motion vectors, it is verified if the current picture is quasi-stationary, then the algorithm of the invention 
(IPD) is executed and finally afunctional block, implementing the hysteresis procedure, produce a flag indicating wheth- 
er the picture is progressive or interlaced. 

[0068] Using the hysteresis procedure, an abrupt change from the interlaced to the progressive scanning mode is 
prevented, thus avoiding that short lasting disturbances (noise) may affect the decision. 

[0069] The method of the invention allows for the calculation of the motion vectors in a most refined manner, by 
adaptively choosing the calculation algorithm that implies the lowest number of calculations. 

[0070] Given the importance assumed by the MPEG2 standard in treating sequences of digital pictures, in the fol- 
lowing it will be made reference to a MPEG2 system, although the same considerations will be similarly applicable, 
mutatis mutandis, even in systems based on different standards. 

[0071] In the MPEG2 standard 2 , as in other standards : at least two different techniques of motion estimation and 
consequent temporal prediction are possible: 

• Frame Prediction, wherein the current picture is divided in macroblocks of frame kind (1 6x1 6 pixels) and for each 
of them the predictor in the preceding frame is found with an arbitrary method for motion estimation. 

• Field Prediction, wherein each macroblock frame is divided in its two component fields (16x8 pixels), one on the 
even half-frame and one on the odd half-frame and for each of them the predictor on the field of the same parity 
belonging to the temporarily preceding picture is found. 

[0072] The best coding is obtained by testing ail possible combinations of predictors of a macroblock: frame, field 
top k on field top k-1, field top /con field bottom k-1, field bottom k on field top k-1, field bottom /con field bottom k-1. 
In all cases, the prediction (and the motion estimation) can be of Forward or Backward kind. Therefore, a great number 
of matching errors need to be calculated for each macroblock. 

[0073] This becomes burdensome for low cost applications: in fact interlaced sequences with little motion may be 
coded as progressive, without any relevant loss of quality, thus avoiding a burdening Field Prediction. Similarly, an 
interlaced sequence with considerable motion would not have any advantage of being predicted as a Frame, and they 
could satisfactorily require only a Field Prediction estimation. 

[0074] It is evident the great computational simplification (from about 30% to about 50%), obtainable by carrying out 
only a Field or a Frame estimation. It would be necessary just a mechanism for telling to the motion estimator when 
to operate in Field mode and when in Frame mode. This is excellently provided by the IPD algorithm of the present 
invention. The only requirement is that the motion estimation be carried out on consecutive fields in one or more steps. 
[0075] The first step, carried out by the block COARSE_SEARCH depicted in Fig. 8, is necessary to calculate coarse 
estimations of motion vectors from successive frames. The second step, carried out by the block FINE_SEARCH, is 
necessary, to calculate a finer estimation of the motion vectors needed to the effective coding. At the end of the first 
step it is possible to carry out the IPD procedure of the invention, to implement a fine motion estimation in the second 
step. 

[0076] Finally, the IPD method of the invention, when associated to a method for motion estimation, provides for 
alternative advantages as compared to the use of the motion estimator without any IPD modulus of the invention: 

• a better quality of the predicted pictures for the same maximum number of calculated matchings per macroblock, 
because it is possible to increase the number of Frame only or Field only predictors (up to certain maximum number) 
by choosing the prediction mode depending on the indication provided by the IPD modulus; 

• a lower number of matchings per macroblock for the same quality of predicted pictures, because either a Field 
prediction or a Frame prediction is carried out, depending on the indications provided by the IPD modulus, saving 
from 30% to 50% of the number of matchings, depending on the motion estimation mode. 



Claims 

1 . A method of processing a bitstream of coded data of video sequences of progressive or interlaced pictures divisible 
in a top half-frame and in a bottom half -frame that comprises estimating motion vectors of groups of pixels belonging 
to said top half-frame of the current picture in relation to pixels belonging to the bottom half-frame of the preceding 
picture (MV TOPX Y (K)) and motion vectors of group of pixels of said bottom half -frame of the current picture in 

2 ISO/IEC 13818-2, Draft International Standard, May 1994. 
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relation to pixels belonging to the top half-frame of the current picture (MV BOTX Y (K)), respectively, characterized 
in that it comprises 

calculating for each macroblock of said top half-frame and a bottom half-frame a respective top motion coef- 
ficient (V TOPX Y (K)) and a bottom motion coefficient (V BOTX Y (K)), depending on the estimation of the motion 
vectors of top half-frame (MV TOPX Y (K)) and bottom half-frame (MV BOTX Y (K)); 

recognizing the current picture as an interlaced picture by a substantial equality of the distributions of values 
of said motion coefficients (V TO p X|Y (K), V BOTX|Y (K)) or as a progressive picture by a substantial inequality of 
the distributions of values of said motion coefficients (V TOPX Y (K), V Borx Y (K)). 

The method of claim 1 wherein said recognition is done by the following operations: 

comparing said top motion coefficients (V TOPXY (K)) and bottom motion coefficients (V BOTXY (K)) with a pre- 
established top threshold (THR1) and a bottom threshold (THR2), respectively; 

counting the number of motion vectors, whose motion coefficients are lower than said top threshold (THR1), 
of said top half-frames (MV TOPX Y (K)) and bottom half-frames (V BOTX Y (K)) of the current picture producing a 
pair of coefficients first (N1 TOP (K)) and second (N1 BOT (K)); 

counting the number of motion vectors, whose motion coefficients are greater than said bottom threshold 
(THR2). of said top half-frames (MVjq PX Y (K)) and bottom half-frames (V BOTX Y (K)) of the current picture pro- 
ducing a second pair of coefficients third (N2 TOP (K)) and fourth (N2 BOT (K)); 

recognizing the current picture as progressive or interlaced picture depending on said coefficients first (N1 TOP 
(K)), second (N1 Ba r( K ))> third (N2 TOP (K)) and fourth (N2 BOT (K)) relative to the current picture and to preceding 
pictures. 

The method of claim 1 wherein the calculated values of said top and bottom motion coefficients are used to carry 
out a preliminary test consisting in 

summing the motion coefficients of macroblocks belonging to thetop half-frame of the current picture producing 
a top sum coefficient (SV TDP (K)); 

summing the motion coefficients of macroblocks belonging to the bottom half-frame of the current picture 
producing a bottom sum coefficient (SV TOP (K)); 

defining the current picture as a progressive picture if said top sum coefficient (SV TOP (K)) and bottom sum 
coefficient (SV BOT (K)) are lower than respective pre-established positive numbers first (T1) and second (T2), 
otherwise proceeding with the sequence of operations of claim 2. 

The method of claim 1 wherein said recognition is carried out by 

calculating a pair of shape coefficients first and second representing distributions of said top and bottom motion 
coefficients, respectively; 

recognizing the current picture as a progressive picture or as an interlaced picture whether said shape coef- 
ficients differ by a quantity greater or lower than a certain value, respectively. 

The method of claim 1 wherein said recognition is carried out by 

calculating a coefficient representing the stochastic correlation between distribution of said top and bottom 
motion coefficients, 

recognizing the current picture as an interlaced picture or as a progressive picture whether said calculated 
coefficient exceeds a certain value or not. 

The method of claims 2 or 3 characterized in that It comprises 

calculating a first ratio (R1 ( K )) between said first coefficient relative to the current picture (N1 TOP ( K )) and the 
second coefficient relative to a preceding picture (N1 BOT ( K J)); 

calculating a second ratio (R2( K )) between said second coefficient relative to the current picture (N1 BOT (K)) 
and said first coefficient relative to the current picture (N1 TOP (K)); 

calculating a third ratio (R3(K)) between said third coefficient relative to the current picture (N3r OP (K)) and the 
fourth coefficient relative to a preceding picture (N2 BOT ( K -1)); 

calculating a fourth ratio (R4(K)) between said fourth coefficient relative to the current picture (N2 BOT (K)) and 
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said third coefficient relative to the current picture (N2 TOP (K)); 

comparing said ratios first (R1(K)), second (R2(K)), third (R3(K)) and fourth (R4(K)) with respective pre-es- 
tablished positive numbers third (y), fourth (5), fifth (e) and sixth (ti) recognizing the current picture as a pro- 
gressive picture if said ratios first (R1 (K)) and fourth (R4(K)) are lower than said numbers third (y) and sixth 
5 (t|), respectively, and if simultaneously said ratios second (R2(K)) and third (R3(K)) are greater than said 

numbers fourth (5) and fifth (e), respectively. 

7. The method of claim 6 characterized in that it comprises 

10 dividing said ratios second (R2(K)) and first (R1 (K)) between them producing a fifth ratio (R5(K)); 

dividing said ratios third (R3(K)) and fourth (R4(K)) between them producing a sixth ratio (R6(K)); 
recognizing the current picture as a progressive picture if requirements of the method of claim 6 are satisfied 
and if simultaneously said ratios fifth (R5(K)) and sixth (R6(K)) are greater than pre-established numbers 
seventh (0) and eighth (x). 

8. The method according to one of the preceding claims characterized in that it comprises 

for each elaborated picture, calculating a temporary weight value (P(K)) as a function of the result of the 
recognition of the current picture as a progressive or interlaced picture carried out according to one of the 
20 methods of the preceding claims; 

for each elaborated picture, calculating a fin al weight value (D(K)) as a function of said temporary weight value 
relative to the current picture (P(K)) and of final weight values relative to preceding pictures (D(l)); 
recognizing the current picture as a progressive or an interlaced picture depending on said temporary weight 
value relative to the current picture (P(K)) and on said final weight values relative to preceding pictures (D(l)). 

25 

9. The method of claim 1 comprising the calculation of motion vectors of a picture of a video sequence by 
calculating said motion vectors by a Frame- Prediction technique if the current picture is recognized as a progressive 
picture or by a Field-Prediction technique if the current picture is recognized as an interlaced picture. 
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